Regret Minimization for Online Buffering Problems Using the Weighted Majority Algorithm

نویسندگان

Sascha Geulen

Berthold Vöcking

Melanie Winkler

چکیده

Suppose a decision maker has to purchase a commodity over time with varying prices and demands. In particular, the price per unit might depend on the amount purchased and this price function might vary from step to step. The decision maker has a buffer of bounded size for storing units of the commodity that can be used to satisfy demands at later points in time. We seek for an algorithm deciding at which time to buy which amount of the commodity so as to minimize the cost. This kind of problem arises in many technological and economical settings like, e.g., battery management in hybrid cars and economical caching policies for mobile devices. A simplified but illustrative example is a frugal car driver thinking about at which occasion to buy which amount of gasoline. We study this problem within a regret analysis. In particular, we investigate the external regret obtained by the Weighted Majority Algorithm applied to our problem. We show that the algorithm does not achieve a reasonable regret bound if its random choices are independent from step to step, that is, the regret for T steps is Ω(T ). However, one can achieve regret O( √ T ) when introducing dependencies in order to reduce the number of changes between the chosen experts. If price functions satisfy a convexity condition then one can even derive a deterministic, fractional variant of this algorithm achieving the same regret bound. Our more detailed bounds on the regret depend on the buffer size and the number of available experts. The upper bounds are complemented by a matching lower bound on the best possible external regret. ∗Supported by the DFG GK/1298 “AlgoSyn” and UMIC Research Center

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Constrained Regret Minimization

Online learning constitutes a mathematical and compelling framework to analyze sequential decision making problems in adversarial environments. The learner repeatedly chooses an action, the environment responds with an outcome, and then the learner receives a reward for the played action. The goal of the learner is to maximize his total reward. However, there are situations in which, in additio...

متن کامل

A Multi-Objective Particle Swarm Optimization for Mixed-Model Assembly Line Balancing with Different Skilled Workers

This paper presents a multi-objective Particle Swarm Optimization (PSO) algorithm for worker assignment and mixed-model assembly line balancing problem when task times depend on the worker’s skill level. The objectives of this model are minimization of the number of stations (equivalent to the maximization of the weighted line efficiency), minimization of the weighted smoothness index and minim...

متن کامل

Blackwell Approachability and No-Regret Learning are Equivalent

We consider the celebrated Blackwell Approachability Theorem for two-player games with vector payoffs. Blackwell himself previously showed that the theorem implies the existence of a “noregret” algorithm for a simple online learning problem. We show that this relationship is in fact much stronger, that Blackwell’s result is equivalent to, in a very strong sense, the problem of regret minimizati...

متن کامل

Online Learning from Experts: Minimax Regret

In the last three lectures we have been discussing the online learning algorithms where we receive the instance x and then its label y for t = 1, ..., T . Specifically in the last lecture we talked about online learning from experts and online prediction. We saw many algorithms like Halving algorithm, Weighted Majority (WM) algorithm and lastly Weighted Majority Continuous (WMC) algorithm. We a...

متن کامل

Online Submodular Minimization for Combinatorial Structures

Most results for online decision problems with structured concepts, such as trees or cuts, assume linear costs. In many settings, however, nonlinear costs are more realistic. Owing to their non-separability, these lead to much harder optimization problems. Going beyond linearity, we address online approximation algorithms for structured concepts that allow the cost to be submodular, i.e., nonse...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Electronic Colloquium on Computational Complexity (ECCC)

دوره 17 شماره

صفحات -

تاریخ انتشار 2010

Regret Minimization for Online Buffering Problems Using the Weighted Majority Algorithm

نویسندگان

چکیده

منابع مشابه

Efficient Constrained Regret Minimization

A Multi-Objective Particle Swarm Optimization for Mixed-Model Assembly Line Balancing with Different Skilled Workers

Blackwell Approachability and No-Regret Learning are Equivalent

Online Learning from Experts: Minimax Regret

Online Submodular Minimization for Combinatorial Structures

عنوان ژورنال:

اشتراک گذاری